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Abstract 



This work studies the capacity region of a two-user ergodic interference channel with fading, where 

oo 

is only available at the receivers. A layered erasure model with one-sided interference and with arbitrary 



fading statistics is studied first, whose capacity region is completely determined as a polygon. Each 
dominant rate pair can be regarded as the outcome of a trade-off between the rate gain of the interference- 
free user and the rate loss of the other user due to interference. Using insights from the layered erasure 



model, inner and outer bounds of the capacity region are provided for the one-sided fading Gaussian 
interference channels. In particular, the inner bound is achieved by artificially creating layers in the 
signaling of the interference-free user. The outer bound is developed by characterizing a similar trade-off 
as in the erasure model by taking a "layered" view using the incremental channel approach. Furthermore, 
the gap between the inner and outer bounds is no more than 12.772 bits per channel use per user, 
regardless of the signal-to-noise ratios and fading statistics. 

Index Terms 

Capacity region, channel state information, deterministic model, fading, incremental channel, inter- 
ference channel, layered erasure model. 
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I. Introduction 

The capacity region of Gaussian interference channels, comprised of one or more interfering 
links, remains open for more than thirty years. Etkin, Tse and Wang [1] have recently made an 
important progress by characterizing the capacity region of the two-user Gaussian interference 
channel to within one bit. Since then, several new results have been obtained, including the sum 
rate in special interference regimes [2], [3], the degrees of freedom of Gaussian interference 
channels [4]-[8] and MIMO Gaussian interference channels [9]. The capacity of fading interfer- 
ence channels has also been studied, e.g., in [10], [11], where the focus has been on scenarios 
where channel state information (CSI) is fully available at the transmitters as well as at the 
receivers. 

This work studies fading interference channels where the instantaneous channel state is avail- 
able at the receivers but not at the transmitters. This is the case in many practical systems where 
the channel state can only be measured by the receivers, which cannot inform the transmitters of 
the state accurately in a timely manner through a feedback link. Specifically, this paper assumes 
independent (fast) fading over time, where the fading statistics are known to the transmitters. 
Note that the result can also be applied to some situations where the transmitters are given an 
estimate of the channel state over a coding block, but cannot track its instantaneous variations. 
This study is different than the work of Raja, Prabhakaran, and Viswanath [12] on compound 
interference channels, where the channel state (from a finite set) is unknown to the transmitters 
but remains static over the course of a codeword. The key issue therein is to find a coding scheme 
which is simultaneously compatible with all interference configurations. The results of [12] are 
applicable to (slow) block fading interference channels with no CSI at transmitters. The current 
paper, however, investigates the ergodic case where the code is designed to average over all 
fading states. 

To make progress, this paper considers interference channels with two single-antenna users 
where the interference is one-sided, i.e., only one of the users is subject to interference from the 
other user. Note that the capacity region of such a channel, also known as Z-interference channel, 
is open even without fading. Such an interference model is suitable if one of the receivers is 
within the range of both transmitters, while the other receiver is out of the range of the interfering 
transmitter. One scenario for this case is a linear network of four nodes with information flow to 
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one direction, where every other node transmits, whose range covers both the intended receiver 
downstream, as well as the unintended receiver upstream. 

Like a number of recent works (e.g., [6], [13]— [16]), this paper makes use of the deterministic 
model approach to glean insights to good coding schemes for general interference models. Despite 
of its simplicity, the deterministic model captures two key physical phenomenons of wireless 
channels, namely, the broadcast nature of wireless transmission, and the superposition of multiple 
signals at the receiver. In particular, fading wireless channels can be simplified to a time-varying 
version of the deterministic model, where the state of a link corresponds to the number of most 
significant bits or not erased by noise. The capacity region of such a layered erasure model 
for two-user fading broadcast channel has been established by Tse and Yates, who then apply 
the insight to obtain a constant-gap characterization of the capacity region of the corresponding 
fading Gaussian broadcast channel [17]. 

The main contributions of this paper include: 

• The exact capacity region of layered erasure one-sided interference channel with arbitrary 
fading statistics is established. 

• Using insights from the converse result for the layered erasure model, a new outer bound 
for the capacity region of the fading Gaussian one-sided interference channel is obtained. 

• A specific coding scheme is shown to achieve a rate region to within a gap of 12.772 bits per 
channel use per user from the outer bound, regardless of the fading statistics, signal-to-noise 
ratios (SNR) and interference-to-noise ratios. 

It should be noted that recent results on the layered erasure model due to Aggarwal et al. are 
special case of the general result in this paper. These include the capacity region for uniformly 
very strong interference (Theorem 3 in [18]) and ergodic very strong interference (Theorem 6), 
and the sum-capacity for uniformly strong but not very strong interference (Theorem 4), uni- 
formly weak interference (Theorem 7) and a special class of mixed interference (Theorem 9). 

The remainder of this paper is organized as follows. The Gaussian fading channel with one- 
sided interference and the corresponding layered erasure channel are described in Section HD The 
main results in this paper are summarized in Section Hill In order to make our development more 
accessible, the capacity region for single-layer erasure channel is established first in Section [IV] 
and the development for the general case is relegated to Section |V] The result for the Gaussian 
fading model is found in Section |VlJ Conclusion is drawn in Section IVIII 
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II. Models and Notation 

Consider an interference channel with two pairs of transmitters and receivers, where the 
message of transmitter 1 is intended to receiver 1, and the message of transmitter 2 is intended 
to receiver 2. It is assumed that the interference is one-sided from transmitter 2 to receiver 1, 
so that the direct link of user 2 is free of interference. 



W Y 




(a) (b) 
Fig. 1. One-sided interference channels with fading, (a) A Gaussian model, (b) A layered erasure model. 



A. The Gaussian Model 

Let W, X, Y and Z denote the transmitted and received signals of user 1 and user 2, 
respectively. Consider the following input-output relationship over each time interval m = 

l,...,n: 

Y m = ^/S^e j6 ^W m + y^e j6 ^X m + U m (la) 
Z m = y/S^eP 62m X m + V m (lb) 

where (Si m , 6i m ) and (S 2 m, 6 2m ) denote the channel gain and phase of the two direct links, 
respectively, and (So m , 0o m ) denotes the gain and phase of the interference link from transmitter 2 
to receiver 1. Such a channel is depicted by Fig. |l(a)| For convenience, let the additive noise 
{U m } and {V rn } consist of independent circularly symmetric complex Gaussian (CSCG) variables 
with unit variance. Let the average power of each transmitted codeword be constrained by 1. 
The state S im can be regarded as the SNR of the corresponding link. Since CSI is not available 
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at the transmitter, the input signals and the state of the three links are mutually independent. 
It is assumed that the fading process for each link {Si m , 6j m } is independent and identically 
distributed (i.i.d.) over time m = 1, . . . , n, and that the amplitude process and phase process 
of each fading process are also mutually independent. Thus all 6n variables, S im , 9 im , with 
i = 0, 1, 2 and m = 1, . . . , n, are mutually independent. Furthermore, the phases 0, (i = 0, 1, 2) 
are assumed to be uniformly distributed on [0, 2ir). Finally, it is assumed that the fading states 
are known to the receivers but not to the transmitters. 

Note that we often drop the time index when referring to the statistics of an i.i.d. process over 
time. For example, Si is identically distributed as Si m for i — 0, 1, 2. 

B. The Layered Erasure Model 

In the spirit of the deterministic models introduced in [19], the layered erasure channel model 
for the one-sided interference channel is depicted by Fig. |l(b)| and described as follows. Let the 
signals emitted by transmitters 1 and 2 at the m-th time interval be denoted by W[m] and X[m] 
respectively, which take values in Ff, where F 2 represents the binary Galois field and F 2 denotes 
the g-vector space with underlying field F 2 . Let s denote a q x q matrix with s i+lji = 1 for all 
i = 1, . . . , q — 1 and all other elements being 0, so that s[xi, x 2 , . . . , x q ] T = [0, x\, . . . , £ g -i] T 
represents a single shift, and s n X[m] denotes a downward shift of the elements of the vector 
X[m] with its n least significant bits dropped out and n zeros padded from the top of the vector. 
The received signals at time interval m are then expressed as: 

Y [m] = s q ~ Nl [m] W [m] © s q - N ° [m] X [m] (2a) 
Z[m} = s q - N2[m] X[m) (2b) 

where {iVofm]}, {iVi[m]} and {iV 2 [m]} are integer random processes taking values in {0, ... , q}, 
which represent the fading states of the three physical links. Let {iVo[m]}, {iVxfm]} and {iV 2 [m]} 
be mutually independent, and each of the three processes be i.i.d. over time (so that the channel 
is memoryless). It is further assumed that the fading states are known to both receivers but not 
to the transmitters. 

We introduce the following notation for the layered erasure model for convenience. For a 
random vector X e F 2 , let X,i denote its i-th element and X{ denote [JQ, . . . ,X,] T . For a 
vector process X[l], . . . ,X[M], we use (Xi)f to denote the sequence Xi[l], . . . , Xi[k\, and use 
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(-X^)f to denote the sequence X\ [/],..., X\ [k\. The indexes outside the parentheses always 
refer to time. Binary addition of vectors of different length is aligned at the least significant bits; 
e.g., if m > n 2 , then X™ 1 © W\ 2 = [X x , . . . , X ni _ n2 , X ni _„ 2+1 © W x , . . . , X ni © W 7 "^] 1 ". Since 
the channel described by © is memoryless, we often suppress the time index to describe the 
model as: 

Y = W? 1 © Xf° 
Z = Xf 2 . 

Furthermore, the distribution of an i.i.d. sequence of random variables is often represented by 
the variable with the time index suppressed, e.g., Ni[m] are identically distributed as iVj. 



III. Main Results 

Throughout the paper, all information units are bits and all logarithms are of base 2. 
Theorem 1: The capacity region of the one-sided fading Gaussian interference channel (OQ) is 
contained in following region: 

0< R x <Elog(l + Si) 
0<i? 2 <Elog(l + S 2 ) 



K 



where 



and 



R 2 



Ri+uR 2 < l+Elog(l + 5i) +cjElog(l + ^^l 



Si+l J 

+/ >/3( 7 )-a( 7 )) + ^d 7 , Vwe[0,l] 



(3) 



So 



Si 



<1<S( 



/3( 7 ) = E 



P (S 2 > 7) - P 



Sn 



>7 



Si 



(4) 



(5) 



Furthermore, regardless of the fading statistics, the outer bound can be achieved to within a gap 
of at most 12.772 bits/s/Hz per user. 

Theorem 2: The capacity region of the one-sided layered erasure interference channel © is 

(I < /,', < EN 2 



C=l{Ri,R 2 ) 



q 



< Ri+uoR 2 < EiVi+wE [N - N x } + + > (uj(3i-ai) + , Vc^ G [0, 1] 



(6) 



i=i 
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where, for every I e {!,••• 



a, = P (iV - iYa < / < N ) 



(7) 



and 



A = E [P (iV 2 > - P {N - N t > TO] 



+ 



(8) 



It is clear that the rate region TZ and the capacity region C are each surrounded by a collection 
of simple affine constraints. As we shall see, this is due to the trade-off between the gain in the 
rate of user 2 and the loss in the rate of user 1 due to interference from user 2, depending on 
the signaling of the users. 

Note that in Theorem |2j setting u = in the second constraint in © yields the single-user 
bound < R\ < EA^ for the rate of user 1. This however does not apply to Theorem Q] because 
of the extra constant 1 in the third constraint of ©. 

In the subsequent sections, we first prove Theorem [2] in Sections [IV] and |VJ Insights developed 
from the capacity-achieving scheme for the layered erasure model are then adapted to prove 
Theorem \T\ for the fading Gaussian interference channel in Section [VI]. Because the proof of 
Theorem[2]for the general layered erasure channel is still quite involved, we first prove Theorem[2] 
in the special case of a single layer to illustrate the key ideas and techniques. 

We also note that in some special cases, the capacity region or sum-capacity of the fading 
Gaussian interference channel can be exactly characterized, which, however, are not implied by 
Theorem [TJ Relevant results are given in Section IVI-CI 



Assume a single layer, i.e., q — 1, throughout this section. Denote the erasure probability of 
the link labeled with N as and let e» = 1 — e, for notational convenience. Evidently, q is the 
probability that the input symbol actually traverses the corresponding link. The region C defined 
in Theorem [2] with q = 1 is quite simple, as is illustrated in Fig. [2] for all possible configurations 
of the parameters. The region is precisely described in the following proposition. 

Proposition 1: Let q = 1. In case of strong interference, i.e., e > e 2 , the region C defined 
in ([6]) is the pentagon with boundary constraints < R\ < e\, < R 2 < e 2 , and 



IV. Proof for the Single-layer Erasure Model 



Ry + R 2 < 1 - e oei , 



(9) 
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\ R\ + i?2 — 1 — £o £ l 

\ 



£o 

(■2 

eoei 



eieo ei 



Rl + i?2 — 1 — £o£l 




eieo ei 



(a) 



(b) 



(c) 



Fig. 2. Capacity region for single-layer erasure channel with different cases drawn by solid lines, (a) £2 < eoei. (b) eo > e2 > 
eoei. (c) ?2 > eo. 



which reduces to a rectangle when e 2 < e ei. In case of weak interference, i.e., e 2 > e , C is the 
pentagon with boundary constraints < R\ < ei, < R 2 < e 2 , and 

QJi Q!i 

fli + -^^2 < ei + ^oei (10) 
Pi Pi 

where a x = e ei and p\ = e 2 — eie . 
Proo/ 1 By © and ©, 

a 1 = P(iVo = l,iV 1 = l) = e e 1 

and 

Pi = E[P (iV 2 = 1) - P (JVo = 1, N, = 0|JVx)] + 
= eie 2 + ei(e 2 -e ) + . 

If e > e 2 , then «i > p\, so that the second constraint in © reduces to < Ri+uR 2 < Ii+ue ei 
for all to E [0, 1]. For every to, the upper bound passes the point (ei,e ei), hence the tightest of 
such bounds is the one with cu = 1, i.e., ©. 

If, on the other hand, e < e 2 , then ati < p\ = e 2 — eie , so that the second constraint in © 
becomes 

Ri + uR 2 < ei + cje ei + (wft - a x ) + (11) 
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for all ui G [0, 1]. For every uj e [0, Qi/ Pi], the upper bound becomes Ri + ujR 2 < ?i + uj^i, 
the tightest of which is (flOl) achieved at uj = a x /p x . For every u; G [ai//3i, 1], the upper bound 
becomes -Ri + co>-R 2 < e ei + we 2 . It is not difficult to see that all of these bounds as well 
as (TTOl) pass the point (e ei,e 2 ). Because R 2 < e 2 , the tightest of these bounds is still (flOl) . 
which describes the dominant face of the region C. ■ 
In the remainder of this section, it is shown that the region C described in Proposition \T\ is 
indeed the capacity region. The region is first shown to be achievable, and then a matching 
converse is established. 



A. Proof of Achievability 

Since X = X and W = W are scalars, and Ni = or 1, we can denote Xf by N { X and 
W^ 1 by N{W . In each sub-figure of Fig. [2l we shadow the pentagon region enclosed by the 
axes, the lines Ri = e x , R 2 = e , and R x + R 2 = 1 — e ei, which is the capacity region of the 
following multiple access channel (MAC): 

Y = NiW®N X. (12) 

Note that if an achievable rate pair (Ri, R 2 ) for channel © falls into the MAC capacity region, 
then the message from transmitter 2 can be decoded at receiver 1. With these in mind, we 
investigate the achievability for all two possible cases: 

If e 2 < e , C is contained in the MAC capacity region (see Fig. |2(a)| and Fig. |2(b)[ ). Let Ber (p) 
denote the Bernoulli distribution which puts probability masses of p and 1— p at values 1 and 0, 
respectively. Any rate pair in C can be achieved by using Ber (1/2) inputs and letting receiver 1 
decode messages from both transmitters. 

If £2 > £0, it suffices to show that the two corner points (ei,eoei) and (eoei,e 2 ), which are 
marked with star and square in Fig |2(c)[ respectively, are achievable. Because the point (ei, e ei) 
is also a corner point of the MAC channel capacity region, it can be achieved. To achieve the 
second point, both users can use random codebooks generated according to Ber (1/2) distribution. 
Let the code rate of user 2 be e 2 . Note that if the fading state (N , Ni) = (0, 1), then Y = W; for 
all other realizations of (No, Ni), Y is independent of W. Therefore, from receiver l's viewpoint, 
this is equivalent to an erasure channel with erasure probability 1 — e ei. Thus the rate e ei is 
achievable by user 1, which shows that the corner point (e ei, e 2 ) is also achievable in this case. 
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To assist the study of general cases, it is helpful to further investigate the achievability of 
star point in Fig |2(b)[ whose coordinate is (e 2 — eoei,e 2 ). In particular, the coding scheme can 
be constructed explicitly using the rate splitting method [20] as follows. We split user 1 into 
two virtual users: one encodes its message through random coding with random variable U ~ 
Ber(<5/2), and the other encodes its message with random variable V ~ Ber (1 — 1/(2 — 8)), 
where 5 G [0, 1]. Let W = max(U, V). Clearly, P (W = 0) = P (U = 0) P (V = 0) = 1/2 so 
that W[l), . . . , W[n] is an i.i.d Ber (1/2) sequence. Also let user 2 generate its codebook using 
i.i.d. Ber (1/2) distribution. Receiver 1 can first decode the message encoded in U at rate 

R U (8)=1(Y;U\N 1 ,N ) 

= P(N 1 = 1)P(N = 0)1(W;U) 
= e 1 e (H(W)-H(W\U)) 



eie 



-n 



2-5 



then it is able to decode the message of user 2 as long as the rate does not exceed 

R 2 (5)=I(Y;X\U,N ,N 1 ) 

= P (N = 1) P (JVi = 0)H (X) + P (iV = Ni = 1)2 (W © X; X\U) 
= e e 1 + e ex [H ( W ® X \ U) - H (W\U)} 



eoei + eoei 



5. 



-n 



i 



2-5 



and finally decode the message encoded in V at rate 

R 12 (6)=I(V;Y\U,X,N ,N 1 ) 



ZxliV-Wp) 



Note that the rate for user 1 is R\(5) = Rn(5) + R%2{5) and it is easy to verify that Ri(5) + 
i?2(5) = 1— eo£i. Furthermore, as 5 varies from to 1, the point (Ri, R 2 ) moves from (ei, e ei) to 
(eie ,e ). Since e > e 2 > e ei, there exits some 5* such that (Ri(5*), R 2 (5*)) = (e 2 — e ei,e 2 ). 
Meanwhile, with i.i.d. Ber (1/2) input (X)™, rate e 2 can be achieved at receiver 2. Therefore, 
the corner point (e 2 — e ei, e 2 ) can be achieved using rate splitting if e > e 2 > e ei. 
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B. Proof of Converse 

Every achievable rate pair (Ri,R 2 ) must satisfy R 1 < Z\ and R 2 < e 2 . Therefore, it suffices 
to show that the rate pair must satisfy © in the strong-interference case where eo > e 2 , and 
must satisfy (flOl) in the weak- interference case where eo < e 2 . 

It is easy to see that, because the two decoders operate independently, the capacity region 
of the one-sided interference channel depends only on the marginal distribution of the channel 
outputs conditioned on the inputs, but not on the joint conditional distribution [21]. It is assumed 
in the remainder of this section that the random variables iVo[m] and N 2 [m] are "aligned" such 
that P(JVo[m] ■ N 2 [m] = 1) = min(e"o,e 2 ) for every m, whereas the state variables {iVi[m]} 
remain independent of {iV [m], iV 2 [m]}. Clearly, if the realization of the weaker one between 
iVo[m] and N 2 [m] is equal to 1, then the realization of the stronger one must also be equal to 1. 
This does not change the capacity region. It is important to note that the alignment does not 
change a>i and Pi either because they depend only on the marginal distributions of N and N 2 . 

Consider first the case e > e 2 . For notational simplicity, let N = (N , Ni, N 2 ) so that (iV)" 
denotes all fading states from time 1 to time n, i.e., (2V)™ = {Ni\j] : i = 0, 1,2, and j = 
1, . . . , n}. By Fano's inequality, Ri must satisfy 

nRi -nd n <l (Y[l\, Y[n}; W[l], W[n]\(N)^) 

=w(o r )?iw?)-w((n?i(w)?,(i>o?) 

<n(l-e e 1 )-H((N X) 1 l\(N^) (13) 

for some 5 n vanishingly small as n — > oo, where (fT3l follows from that Tt ((Y)^ |(JV)") is 
maximized by setting both (W)" and (X)™ to be i.i.d Ber(l/2) sequence. Also due to Fano's 
inequality, R 2 must satisfy 

nR 2 -nS n < I((Z)?;(X)?|(JV)?) 

= H((N 2 X) n 1 \(Nr i ) . (14) 

Note that e > e 2 by assumption, so thatiVo > N 2 , and, thus H ((N X)^\(N)^) > H ((iV 2 X)™|(iV)?). 
Comparing (TT31) and (TT41) yields © as n — > oo. 

Next, consider the case of e < e 2 . Let (W)^ = (W[l], . . . , W[n\) be an i.i.d. Ber(l/2) 
sequence independent of all channel inputs and channel states, and let Y = N%W © N X. 
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Fano's inequality requires that 

nR\ — n5 n 

<!((¥)-; (W)JI(JV)J) 

= H {{N X W © iVoX)J|(JV)J) - W ((iV X)?|(JV)?) 
<W ((iW © iW © iV ^)i I Wl)- H {(NoXftKN)?] 
=H {[N X W © N Q X)l\{N)l]- U {(NaX)?\(N)?) 

= i ( (y)» ; (iv)f 



(15) 
(16) 



where (fl5T) follows from data processing theorem and (PT6l) is due to the fact that © is 
identically distributed as W. Breaking down the mutual information in another way, we obtain 

nR\ — n5 n 

< H ({N x W)l (N)f \ - H ({NiW)™ (1% (JV)J 

= nc 1 -w((JV 1 W)j|(K)?,(JV)?) 
= nci-w((AbA')?|(?)? l (JV)j) . (17) 

Also by Fano's inequality, 

<1((N 2 X)1; (X)?|(JV)i) 

<j((wr)?,(y)? ; (x)?|(N)y 
= j (x) j I (jv)?) + x ((iv 2 x)- (y)?, w? 

<ne ei+^(> 2 X)? (?)?, (JV)J 



(18) 

The upper bounds (flTT) and (fT8l) can be understood as follows: The rate pair (ei,e ei) can be 
achieved by letting user 1 decode and cancel the signal of user 2 as shown in Section IIV-AI 
By choosing the signaling X, user 2 can improve his/her own rate by H MiVaX)" , (JV)"^ 
at user l's rate expense in the amount of 7i ((NqX)™ (V)", (JV)iJ due to interference. In the 
following, we consider the trade-off between the rate loss of user 1 and the rate gain of user 2 
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over all choices of the signal (X)™. Using Marton-like expansion [22], [23] and the chain rule, 

H ((iV 2 X)?|(lX (JV)?) - n ((iv x)?|(>% (jv); 

n 

= E ((WOi, (tv x)? +1 |(y)?, (jv)?) - w ((W, (#oX)?|(i% (AO?) } 
i=i 

n 
i=l 

-w ((iv 2 x)r\(iVoX)« +1 |(y)?,(Ar)?) - w ((iv x) i |(iv 2 x)r\(iVoX)r +1 ,(y)?,(Ar)?)} 

n 

= E ((^)il(^aJOi- 1 , (AT)") 

i=l 

-W ((iVoXJ^JVaX)*" 1 , (iV X)? +1 , (?)» (AT)?)} • (19) 



iV X © iVW, T, AT . 



— H N X 

Proof: Since there is no conditional uncertainty about iVoX unless iVo = Ni 



H (N X 



The expression ([191 ) concerns only the conditional entropy of single random variables. To 
bound (TT91 , we need following lemma: 

Lemma 1: Assume alignment of N and X 2 . Let T be a collection of random variables which 
are independent of N = (N , Ni, N 2 ). Let W be a Ber(l/2) random variable independent of 
X. Then 

'n x®n 1 w,t,n) =h(x\t) = ^h(n 2 x 

ibc 

X X © NiW, T, N) = P (iVi = 1, iV = 1) H (X\T) 

= a{H(X\T) . 

To see the second equality, write 

H (N 2 X N X © NiW, T, 

= P (N = Nx = N 2 = 1) H [X X © W, Tj + P (N = 0, N 2 = 1) H (X\T) 
= P (N = N 2 = 1) P (Ni = l)H (X\T) + P (No = 1, X 2 = 0) W (X\T) . 

Since X and N 2 are aligned, P (JV = N 2 = 1) = min(e , e 2 ) and P (JV = 1, N 2 = 0) = (e 2 - 
e ) + . Note also that e,\ min(e 2 , e ) + (e 2 — e ) + = e{e 2 + ei(e 2 — e ) + = (3\ with assumption of 
e 2 > e . Hence the proof of Lemma [TJ ■ 
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Now back to the proof of the converse result. For each i, we apply Lemma [T] to (fl9l ) with 

T t =({N 2 X)\-\ (iV Q X)r +1 , (Y)\-\ (Y )2 ¥1 , (N)\-\ (N)l +1 
which is independent of (iV)t. In particular, 

n ((iy x) 4 |(iv 2 x)i-\ (N x)? +1 , (r)?, W? 

= H({N Q X) i Tj, (A/"); 



nUN 2 X)i (Y)i,Ti,(N), 



Oil, 

Ji 

^-H (^(N 2 X) i ^(N 2 X) i f 1 , (iVoX)« +1 , (y)» (AT)" 



By CGI and ([20] 

ft ((N 2 X)i (¥)'[. (N)'l ) - 7Y ( (A'o.Y)',' (Y)".(N)" 



(20) 



= E ( X - f ) W ((N2X) t \(Y) l} T u (AT)*) 

^ E ( x - f ) w ((^) i |(iv 2 x)r i , (5% 



1 - ^ ) W ( (Y)l (JV)" 



(21) 



(22) 



where (|2T|) is due to the facts that conditioning reduces entropy and p\ > cui when e < e 2 , and 
(1221) is by the chain rule. We rewrite (1221) as 



n ( (JVoX)? (*% (at)« )>fn[ (N 2 X)l (y)» (AT)" 



(23) 



Comparing ([HI) . (fl8l) . and (El), we have 



ni2i + ^n_R 2 - n5 n - ^-n5 n < e x + ^e ei. 
Pi Pi Pi 



Sending n — > oo yields (TTOT ). 



V. Proof for the General Layered Erasure Model 

In this section, we prove Theorem [2] in full generality, using insights obtained in the proof 
for the single-layer model. We begin with the converse. 
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A. The Converse Part of Theorem [2] 

Because the capacity region depends only on the marginal distributions of the received signals, 
we can assume arbitrary joint distribution of the fading coefficients iVo and A^ as long as the 
marginals remain the same. Throughout the proof of the converse of Theorem |21 (Section IV-AI) . 
it is assume that {Ax[m]} is independent of {jVo[m], A^m,]}, and {iV |m]} and {Af 2 [m]} are 
aligned as described in the following. Let Fjv(n) — P (N < n) denote the cumulative distribution 
function of an arbitrary random variable N, and define its inverse as F^ 1 ^) = inf{w : F N (u) > 
t}. Let A be a uniform random variable on [0, 1], then F^ 1 (A) is identically distributed as N. 
Let 

N [m] = F^(A[m]) , and N 2 [m] = F^(A[m\) , m = 1, 2, . . . (24) 

where {A[m]} are i.i.d. and uniformly distributed on [0,1]. Basically, once aligned, a larger 
realization of N implies a larger realization of A" 2 , and vice verse. Note that, unlike in single- 
layer case of q = 1, there is no guarantee that one of the fading coefficients dominates the other 
(e.g., N = 1 but A" 2 can take values of and 2). Therefore, the layered interference channel 
cannot always be categorized as a strong or weak interference channel. 

It is important to note that ai and (5u I — 1, . . . , q, as well as the region C, remain unchanged 
after the alignment. In particular, only the marginals distributions of N and N 2 are used in the 
definition of Pi. 

Let the elements of (W)™, i.e., Wi\j], i = l,...,q and j = l,...,n, be i.i.d. Ber(l/2) 
random variables. Also, (W)™ is independent of all channel inputs and states. The following 
generalization of Lemma \T\ holds. 

Lemma 2: Suppose Ao and A^ 2 are aligned according to ([24]) . Let T be a collection of 
random variables independent of N = (N , Ni, A" 2 ). Let X be an arbitrary random vector 
in F2 independent of W. Then 

q 

H I Xf° © w[ 1 ,T,n)=J2 a{H (X t \ X'f \ T) (25) 



1=1 
q 



H (Xf 2 1 Xf° © W* 1 ,T,N)=Y^ frH (Xi I X[-\ T) (26) 

1=1 

where a; and $ are given by © and ([8]), respectively. 

The proof of lemma [2] is based on direct computation. For details, see Appendix H It is 
interesting to note that the expression of (3i given in © is independent of the correlation between 
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N and N 2 , although the derivation depends on the alignment between them. Therefore, the 
expression of capacity region © does not depend on the artificial alignment either. 

An interpretation for Lemma El is as follows. Suppose that we can observe X through three 
channels: Xf°, Xf 2 , and Xf° © W*\ By ©, a x is the probability that layer / of X can 
be seen in X t but not through the channel X 1 © W 1 . Whenever this event happens, the 
amount of entropy H (XjjX^ 1 , T) is accumulated (via chain rule). Hence, ([25]) follows. Similar 
interpretation can be obtain for (|26l) by noting that is the probability that layer / of X can 
be seen in X 1 2 but not through the channel X 1 © W 1 under the assumption of alignment 
between N and iV 2 . 

Equipped with Lemma[2l we next establish the constraints on the capacity region in Theorem[2] 
The first constraint in © is trivial by cut-set bound. The second constraint for Ri + ujR 2 is 
proved in the following. 

~ —Ni N 

For notational convenience, let Y = W 1 © X 1 °. By Fano's inequality, 

= H((W? 1 ©Xf°)?| (AO?) -ft((Xf°)?| (AO?) 



-JVi 



< n ( ( w _ r © 1 © x j ; 



A0?)-ft((xf°)?|(A0?) 



-JVi 



w (w\ ©xf°)? (ao? -ft((xf°)?|(A0?) 



(27) 



- JVi 

-JVi 



(AO? 
X( 



1 



1% (at)" 



ATi, 



[Y) n i, W? 



(28) 



-iVi 



where (f27l) is because VF^ 1 © W x is identically distributed as W x 
Also by Fano's inequality, and by the chain rule of mutual information, 

nR 2 -nS n <l( (Xf 2 )?; (X)?| (AT)") 

- 1 [ (Xf 2 )?, (Y) n i, (*)? W? 



J ( (V)?; (X)? (AO? ) + J ( (Xf 2 )- (X)? (Y)?, (AT)? 



J ( (F)?; (X)? (N)i )+H( (Xf 2 )? (F)?, (AT) 



(29) 
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By the property of memoryless channels, the first term on the right hand side (RHS) of (1291) can 
be upper bounded: 

n 

1 ((F)?; (X)?| (JV)?) < E J ((F),;; (X),|(iV)?) 

1=1 

n 

< J2 n {<r)*\ w") - w ((^r 1 ),! w? 

i=l 

< nEmax(JVi, iV ) - raEJVi 

By dlU) and (O, we have 

nR 2 - n5 n < nE [N - N,] + + H ( (Xf 2 )? (Y - )", (iV)™ 



(30) 



(31) 

Note that (1281) and (f3TT) have the same interpretation as (TT71 ) and (fT8b . respectively. By (|28l ) 
and (I3TT) . we have the following weighted bound for every u; G [0,1], 

+ nwifc - (1 + uj)n5 n < nEiVi + nuE [N - iVi] + 

- H f (Xf )? (Y")i, (2V)?) + f (Xf 2 )? (Y-)J, (JV)?) . (32) 



Similar to what was shown in the case of q = 1, we use a "Marton-like" expansion to write 
the difference of the two entropies on the RHS of (1321 in the special case of cu = 1: 



n ( (xf 2 )?|(i% (N)ij - n [(x»Ti (*% (jv)? 

n 

E ((^ a )i. (*f°)?+i|(*% Wi) 

1=1 



-w (jxHr\(*r)? cn?,(JV)y 

n 



+ w ( (xf ), (xr)r 1 , (xr )r + i, on?, w? 



-2V 2 \i-l 



- w ( (xf)i- 1 , (Xf 



~iVo\n 
>i+l 



H ((Xf°),|(Xf )\-\ (X^iYK, (AT)?) } 



^{^((Xf 2 ), (Y)i,T,,(JV)i) - (Y),.T,(iV), 

i=l 



(33) 
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where 



r, = ( (x^)\-\ (xf°)r +1 , onr 1 , (y)r + i, mr\ wr+i 



(34) 



which is independent of (2V)». Note also that 

ft((Xf 2 )?|(i%(iV)^ 

n 

8=1 
n 

>^H((xf 2 ),|(n,T,,(Ar), 



i=l 



(35) 



Therefore, by (|33l) and d35l) . the difference on the RHS of (|32l) for any uj e [0, 1] can be written 

as 

=w((xf ra )j|(y)j,(jv)?) )? (i%(jv)? 

-(l-u;)w((Xf 2 )? (r)?,(JV)? 

n 

<£{w((xf i ) i |(Y-) 4l r <l (jv) < , )-w( r (jrf'») < m,.T,. (An. 

n 
i=l 

n 

= £ {u,W ((Xf 2 ),|(r),„ Tj, (N)i^j - H ((X? 

i=l 

Applying Lemma |2] to both entropy terms with 7\ defined in (|34l ) yields 
luH {(X^) n i\ (Y)i, Wi) " W f(*f°)i (*% (JV)? 



r i9 (jv), 



:n,^,(iv),, 



n q 



i=i 1=1 
<n^(o;A-^) + - 



^1 l )i,Ti 



(36) 



!=1 



Therefore, substituting (l36l) into ((32l) and noting that 5 n — > as n — > oo, we have established 
the converse part in Theorem [2] 
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E[7V -iVi] + 




Curve C 








Fig. 3. An illustration of the capacity region for general layered erasure channel. The region is generally enclosed by the axes, 
line R2 = EJV2, and a piece- wisely linear curve C. The top maximum-sum-rate point is marked by square, whose position 
variates for different cases. 



B. The Achievability Part of Theorem [2] 

Let us first investigate the geometry of the region C given by ©. Assume EA^i,EAT 2 ^ 0; 
otherwise, the capacity region is trivial. The region bounded by the second constraint in © can 
be viewed as r\, e [o,i]-ff(u;) n [0, +00) 2 where 



and B{uj) = {I G {1, - • • , q} | u(3i > a{\. Let us order {a\J (3\ \ I — 1, . . . , q} as u\ < ■ ■ ■ < 
ujb < 1 < ojb+i ■ ■ ■ < and let the corresponding permutation be referred to as r so that 
k>t — olt{i)I ' f3 T {i), i = !,•••,<?• In addition, let lu = 0. It turns out that except for the b + 2 
constraints H(u>k), i — 0, . . . , b, and H(l), all constraints H(u) with other uj are redundant: 
Proposition 2: 



H{u) = I (R u R 2 ) R x + ujR 2 < EA^i + luE[N - N t ] + + ^ - aj) 





(37) 



we[o,i] 
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Proof: For every uj 6 [0, 1], H(u) is a half plane to the left side of a straight line. For 
i = 0, . . . , b — 1, the boundary of H(u>i) and the boundary of H(uj i+ i) intersect at the following 
point: 



which is denoted by L f from now on. In particular, this is because for layer I in B{uj i+ i) but not 
in B(u>i), we have uj i+1 (3i — ai = 0. In addition, let L b be the intersection point of the boundary 
of H(ujb) and the boundary of H(l), whose coordinate is also given by d38l) by changing the 
subscript i to b. Define intervals f2j = (u>i, uji + i) for i = 0, ... ,b — 1, and ^ = (o;&, 1). For 
every i = 0, . . . , b, it is not difficult to see that B(o;) = i?^) for all u; G fij. Furthermore, the 
boundary of H(u) also passes point Lj. Thus we see that the constraint H(uS) is redundant to 
H(u>i) and H(cu i+ i) (or for i = b). Hence the proof of the proposition. ■ 

In this one-sided interference channel problem, the parameter uj can be interpreted as a 
preference between rate loss of user 1 and rate gain of user 2 in view of (|28l ) and (|3TT) . This is in 
contrast to the layered erasure broadcast channel problem studied in [17], where the role of the 
weighting parameter in a weighted sum-rate characterization of the capacity region is interpreted 
as a preference between the two users. 

In general, the upper bound of the second constraint in ©, henceforth referred to by the 
boundary C, is piece-wisely straight, and its corner points are L , . . . , L b . Note the point L i+1 is 
always on the upper left side of Lj. In case of degeneracy, the two points coincide. The region 
C is the region enclosed by C, the line R2 = EiV 2 , and the two axes. The line R 2 = EA^2 can 
intersect with the curve C at various positions, and we call the intersection the top maximum- 
sum-rate point for obvious reason. To prove the achievability of the region, it suffices to show 
that the top maximum- sum-rate point and all corner points Lj below it are achievable. 

1) The Corner Points L , . . . , L b : For each i £ {0, . . . , b}, consider the achievability of the 
corner point which is below the top maximum- sum-rate point. Let user 1 generate a random 
codebook of rate 




(38) 
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and let user 2 generate two codebooks: one is for private message at rate 

R2 P = J2 P ( N *> 1 ) (39) 

the other for common message at rate 

R2c= Yl P(JVo-JVi>0 (40) 

where U(uS) = {1, . . . , q}\B(u) for every uj E [0, 1]. For notational convenience, for a random 
vector X G Fj and a subset of {1, . . . , q}, A, we denote X^ as a g-dimensional vector whose 
Zth element is X; if I E A and equals to 0, otherwise. All codebooks consist of i.i.d. Ber(l/2) 
entries. The codeword of user 1 is transmitted as W. The codeword for the private message of 
user 2 is transmitted as Xqi^a using the layers in B(ui), whereas the codeword for the common 
message is transmitted as Xu( Ui ) using the remaining layers. 

Note that, by ©, we can also write oti = E[P (N > I) - P (JV - N x > l\Ni) ]. Comparing 
with ([8]), we find that for every I E B(uJi), Pi > an since uji < 1, so that P (N 2 > I) > 
P {No > I) > P (N -N> l\Ni), which implies that A = P (N 2 > I) - P (N -N> I). 
Therefore, by ([39]> and (|40)>. 

R2c + R2 P = J2 p ( iV 2>0+ Yl p ( N o-Nt>l) 

= 5>(JV o -iV 1 >0+ P(iV 2 >/)- ^ P(JV o -JVi>0 

1=1 l€B(ui) l€B(ui) 

= E[N - N} + + A ■ 

Thus by (1381) . the codebooks carry exactly the rate pair at Lj. Therefore, to show achievability 
of point Li is equivalent to show that the rate triple (i? l5 R 2p , R-ic) is achievable. Indeed, receiver 1 
can decode the common message, because 

I (W^ © Xf°; X M(wi) |JV , iVi) = ^ I {W^ © Xf°; X|iV , N ± ) 

= P(^Vo-JVi>0 

ze«(u>i) 

= -R2c • 
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Intuitively, for given N = n and N% = n 1 , the signal X\ at level / in IA{uj^) contributes to the 
mutual information if and only if n — n\ > I. After canceling the interference caused by the 
common message, receiver 1 can decode its own message, because 

1(W^®X^-W\ X u(Ui) , N u N ) 

= H (Wf 1 \N U N )-H{ I Wf 1 © Xf°, X M(Wi) , iV 1; iV ) 
= EJVi - W (xf° | Wf 1 © Xf , X M(wi) , JV a> iV ) 

= EiV\- ^ at (41) 

where (|4T|) can be regarded as a consequence of Lemma [2l Receiver 2 can decode its private 
message because 

1(X^;X BM \N 2 ) = J2 ^(Xf ;X,|iV 2 ) 

= -R2p • 

Receiver 2 can also decode the common message: 

l{X^-X u[Ui) \N 2 ) = P(N 2 >1) 

= EN 2 - R 2p 
> R 2c 

where the inequality is because Li is below the top maximum- sum-rate point, i.e., R 2p + R 2c < 

EN 2 . 

2) The Top Maximum- sum-rate Point: We establish the achievability of the top maximum- 
sum-rate point in all three possible cases depending its position. 

Case 1: The top maximum- sum-rate point is below L . In this case, E[iVo — N\] + > EN 2 , so 
that the region © becomes rectangular. It suffices to show that (ENi,EN 2 ) is achievable. Let 
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user 1 and user 2 each generate a random codebook with i.i.d. Ber(l/2) entries, with rate EiVx 

and EN 2 , respectively. Then receiver 1 can decode the message of user 2, because 

i 

1 (W^ © X^XlNi, N Q ) = Y, 1 ^! 1 © X i°-> X i\ N ^ N o) 

i=i 
q 

= ^2P(N -N 1 >1) 

1=1 

= E[N - JVx]+ 
> EN 2 . 

After canceling the interference, receiver 1 can decode its own message, because 
1(W^ ©Xf°; W\X,N U N ) =I(W? 1 ;W\N 1 ,N ) 

= EiVi. 

Also, receiver 2 can decode its own message because 

1(Z;X\N 2 ) =H(X^\N 2 ) 
= EN 2 . 

Case 2: The top maximum- sum-rate point is between L and L b , i.e., the intersection of line 
R 2 = EA^ 2 and the curve £ is on the boundary of H(uo k ) for some k G {1, . . . , b}. The basic 
idea of the coding scheme is to transmit a private message using layers r(l), . . . , r(k — 1) and 
part of layer r(k), and to transmit a common message using layers r(k + 1), . . . , r(q) and the 
remaining part of layer r(k). 

Let user 1 generate a random codebook with i.i.d. Ber (1/2) entries. Define set B' = {r(l), . . . , r 
1)} and set W = {r(k + 1), . . . , r(q)}. Let user 2 encode its common message onto (X u >, U), 
and encode its private message onto (X&,V), where U and V are two binary signals. The 
transmitted signal X then consists of X u >, X B > and X T ^ = max(C/, V). The codebook for 
the common message consists of random independent entries, where the elements of X u > are 
Ber (1/2) and U ~ Ber (5/2) for some 5 E [0,1]. The codebook for the private message is 
generated similarly, but with V ~ Ber(l — 1/(2 — 5)). We note that 

H(X T{k) \U) = P(U = 0)H(V) 
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For fixed 5, the common message can be decoded at receiver 1 as long as its rate does not 
exceed 

R 2c (5) = X ( © Xf°; X u >, U\ N u N ) 



g 

j-a<Af -Afi) (1-W(X,|X M /,C7)) 

i=i 

E[N - N,] + - E{l (T{k) < No _ Nl) n (X T{k) \U) } - E 1 (2<iVo-jVi) 



= E[iV - iV\] + - P (iV - Ni > r(k)) (^t) - 1] P (#o - m > I) 

where (1421) is used to reach the last equality. Once the interference caused by the common 
message is removed, receiver 1 can decode its own message at rate 

Ri(5) = X ( Wi 1 © Xf; | X w , U, Ni, N ) 

= H (W? 1 |JVi) - W ( VFf 1 1 Wf 1 © Xf°, X w , 17, iVi, N ) 

= ENi - H (Xf° | Wf 1 © Xf°, X u , U, N u No) 

= EiV\ - a T{k) H (x T(fc) |t/) -J^W ( 43 ) 

leB 1 

2-5 ( 1 \ v-^ 
= EiV a - a T(k) —H f — - ) - £ «, (44) 

where (|43T) follows by Lemma [2l 

Let 5 be such that the common message of rate R 2c {5) is decodable at receiver 2. Once the 
common message is canceled, then the following private rate is achievable 

R 2p (5) = X (Xf 2 ; Xg', F| tf, N 2 ) 

= p > + p (n 2 > r(k)) 2 -^n ■ ^ 

leB' ^ ' 

It suffices to show that we can make (i?i(<5), R 2p (S) + R 2c (5)) coincide with top maximum-sum- 
rate point by choosing some 5. Note that 

R 2 (5) = R 2c (6) + R 2p (5) 

= E[N - N,] + + P T (k)^n (^—) +J2 & ■ (46) 

^ ' leB' 
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Multiplying (1461) with uo k and adding with (1441) . and noting that u k /3 T ( k ) = a T f k )> we have 

R 1 (5)+uj k R 2 (5)=EN 1 + uj k E[No-N 1 } + + ^ M - a{) (47) 

which is exact the equation of boundary of H{uj k ). Therefore, (Ri(5), is on the boundary 

of H(uj k ) and as 5 varying from to 1, point (Ri(5), R 2 (5)) continuously goes from point L k to 
point Lk~\. There must exist some 5* such that (Ri(5*), R 2 (5*)) is the top maximum- sum-rate 
point. 

Case 3: The top maximum- sum-rate point is above L b , i.e., the intersection of line R 2 = ~EN 2 
and curve C is on the boundary of H(l). The basic idea is to split the message of user 2 into 
a common message and a private message as in case 2; and regard user 1 as two virtual users 
in order to exploit rate splitting. 

Let user 2 transmit its private message using Xbcv) an d transmit its common message using 
Xu(i)- Define random vectors U, V E F|, where the elements of U are i.i.d. Ber(<5/2), and 
the elements of V are i.i.d Ber(l — 1/(2 — 5)), where 5 E [0, 1]. We split the user 1 into two 
virtual users, with codewords U and V, respectively. The transmitted codeword consists of 
Wi = max([/i, Vi), i = 1, . . . , q. 

Let receiver 1 decode U first, then decode by removing U, and finally decode V by 

removing X u ^ further. For fixed 8, following rate triple is achievable at receiver 1: 





= x 






R2c(S) 


= i 






RiM 


= x 







iVi,iV 

C/,iV 1; iV 



At receiver 2, the following private rate is achievable: 



R2 P {&) = i [x^ 2 ] X B {\) 



N-, 



as long as 5 is chosen such that the rate of the common message satisfies R 2c (S) + R 2p (5) < EN 2 , 
so that the common message can be decoded first at both receivers. It then suffices to show that 
there exists S* E [0,1] such that (Ri{5*), R 2 {6*)) = (Ri,i{6*) + R 1j2 (S*) , R 2c (S*) +R 2p {5*)), 
coincides with the top maximum- sum-rate point. 
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By the chain rule, 

R^S) + R 2 (6) = R hl (S) + Ri, 2 (5) + R 2c (5) + R 2p (5) 

= 1 (wf 1 © Xf°; U, V, Xuw\N u iV ) + X (xf 2 ; X B(1) 
= J (V f 1 © Xf ; W, X U (i) | N, , iV ) + J (xf 2 ; X B(1) 
= J (Wf 1 © Xf°; w|jVi, iV ) + J (wf 1 © Xf°; X w(1) 

+ j(xf 2 ;X B(1) |iV 2 ) 
= EfJVx - iV ] + + P ( N ° - + P > /) 



iV 2 
iV 2 ) 
W,N!,N 



ieu(i) 



ieB(i) 



= E[JVx - iV ] + + EiV - ^ P (iV > /) + ^ P (iV 2 > /) 

266(1) ZeB(l) 

= EiVj + E[N - N ± ] + + J2 (A - "0 • ( 4g ) 

ieB(i) 

Thus, (R 1 (5), R 2 (5)) is on the boundary of Note that -R 2 (<5) increases as 5 increases 

because larger 5 indicates larger part of user l's signal is removed before decoding the common 
message at receiver 1. Let 5=1, 

Rail) = ^ P (iV > /) + ^ P (iV 2 > /) 
l&t(i) ieB(i) 

>X)P(iv 2 >0 

1=1 
= EN 2 

where the last inequality follows by the fact P (N > I) > P (N 2 > I) on I e W(l). Since both 
and -R 2 (5) are continuous function, there must exist a 5* such that (Ri(5*), R 2 (5*)) is the 
top maximum- sum-rate point L b , which falls on the line segment between (^(0), -R 2 (0)) and 
(i?i(l), R 2 (l)). This completes the proof of Theorem [21 



C. Examples 

Before the end of this section, we investigate some special cases for the layered erasure 
channel with one-sided interference. 
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R 2 



> . 



(49) 



7 J 77ze Case of Stochastically Strong Interference: P (iV > /) > P (JV 2 > f° r every / G 
{1, . . . , q}. This implies that oj(3i < on for every I E {1, . . . , q} and every c<j G [0, 1]. Therefore, 
the second constraint of © can be simplified as H(0) D H(l). Hence, the capacity region with 
stochastically strong interference can be simplified to 

< Ri < EN t 
< R 2 < EN 2 
R x +R 2 < Emax(iVi,JVo) 
This generalizes Theorems 3-5 in [18]. We also notice that this result has been essentially 
established in [24] as a special case. 

2) The Case of Stochastically Weak Interference: P (iVo > /) < P (A^ 2 > /) for every / G 
{1, . . . , q}. Therefore, > a t for all / G {1, . . . , q}. The capacity region can be represented by 

' < /,'; < EiVi 

< R 2 < EN 2 

Ri + uj k R 2 < ENi + cj fc E(iVo - ^i) 
Furthermore, the sum capacity is 



(Ri, R 2 ) 



k E {l,...,q} 



(50) 



C sum = + E[iV - N,]+ + J2(p t - 



an 



= E m&x(N , Ni) + EN 2 - EN 

which is a generalization of Theorem 7 in [18]. 

3) The Case of Pure Deterministic Model: N = n , Ni = n\, and N 2 = n 2 are known to 
both the transmitter and the receiver. If n > n 2 , it falls into the case of stochastically strong 
interference. If n < n 2 , it falls into the case of stochastically weak interference where — 1 
for I G {1, . . . , (n 2 — n )} and u(l) = otherwise. Therefore, by simplifying (|49l) and (|50l ), the 
capacity region becomes 

< Ri < n x 

(Ri,R 2 ) 0<R 2 <n 2 > (51) 

Ri + R 2 < max(ni,n ) + (n 2 - rii) + 
which follows previous results on the capacity region of deterministic interference channels 
in [6], [25]. 



November 5, 2009 



DRAFT 



28 



VI. One-sided Fading Gaussian Interference Channel 

In this section, we study the one-sided fading Gaussian interference channel model described 
in Section [TT] and prove Theorem [TJ An outer bound for the capacity region is first derived 
by converting the channel to a "layered" model through the use of "incremental channels." 
An achievability result is then developed in analogy to the coding techniques introduced for the 
layered erasure model. The capacity for a few special cases is provided at the end of this section. 

A. The Outer Bound for the Gaussian Model 

Note that the capacity region depends only on the marginal distributions of the channel outputs 
Y and Z conditioned on the channel inputs and states. We can thus replace the joint distribution 
by any distribution compatible with identical conditional marginals, so that the capacity region 
is preserved. Throughout Section IVI-A[ we assume alignment of the links as follows without 
changing the capacity region. First, because the phases are known and can be compensated at 
the receivers, we can assume 6o m = ©2m = for all m = 1, . . . , n without loss of generality. 

Secondly, we assume that the fading states of the direct link for user 2 and the interference 
link are aligned, i.e., for every m — 1, . . . , n, their SNRs are driven by the same random variable 

Som — ^(^m)! and S 2m = F^A™) , 

where {A m } are i.i.d. and uniformly distributed on [0, 1]. The states {Si m } remain independent 
of {S 0m , S 2m }. It is important to note that the region 1Z remains the same, because the bounds 
in © are invariant to the dependence of Sq and S 2 introduced here. 

Third, we assume that the additive noises at the two receivers are also aligned. This is 
easy because Gaussian noise is infinitely divisible. Let {B m (y),v > 0}, m — 1, ...,n be n 
independent CSCG continuous -time processes, each of which is of independent increments with 
E{|-B m (z/)| 2 } = v. Basically B m is a complex-valued Brownian motion. Without changing the 
capacity region, we can simplify the model (OQ) to the following: 

Y m = X m + ^ e jQm W m + B rn (±- 



where m— 1, . . . , n and 6 m are i.i.d. uniform on [0, 2ir]. 
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Fig. 4. An illustration of noise alignment via incremental channel. The signal X is corrupted by a circularly symmetric standard 
complex Brownian motion. Z, Z, and Y are generated by taking the corrupted signal out at time I/S2, 1/So, and (Si + l)/So, 
respectively. 



For convenience, let S = (So, S\, S2) and = (Oo,Oi,02). The notation for the time 
index in this section is different than that used for the layered erasure model in Sections [TV] 
and |VJ In general X m refers to a signal at time interval m, and X n refers to the signal over 
n time intervals, (Jfi, . . . , X n ). Moreover, let S n = {Si[j} \ % — 0, 1, 2 and j = 1,2,..., and 
© n = {©4j1 I i = 0, 1, 2 and j = 1, 2, . . . , n}. 

By Fano's inequality, the rates of the two users must satisfy 



nRx - n5 n < l(W n ; Y n \S n , @ n ) 
nR 2 -n5 n <l(X n ;Z n \S n ) . 

For convenience, let us introduce signals Z and Y as follows: 

1 



(52) 
(53) 



X + B 



Sf) 



Si 




Y = X + \I^W + B 



1 

So 



X + B 



5*0 



where we have implicitly defined W as a unit CSCG random variable, which is proportional to 
the increment of the Brownian motion, and is hence independent of X and the additive noise 
B(1/Sq). By [26, Corollary 2], setting the distribution of the input W to unit CSCG incurs no 
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more than 1 bit of loss in the mutual information on the RHS of (l52l). That is 



S n )+n. 



nRi-nS n <XiW ;Y 
Because W — (Y — X) — Y is a Markov chain, 



nRx — n5 n -n<X [W ; Y \S n 



/ . — n ^ n 
X[W ,Y -X n 



nE log (1 + 51) - X [Z ;X n 



S n ) - l( W n : Y n - X n 



Y\S n \ . 



Y ,S r 



Meanwhile, the bound (l53l) on the rate of user 2 becomes 



nR 2 -n8 n < l(X n ;Z n \S n 



< 1 i^X n ; Z n , Y 
= I [X n ; Y 

< nElog ( 1 + 



S n j + J ( X n ; Z n 



Y ,S r 
+ 1 [X n ;Z n \Y~,S r 



(54) 



(55) 



We have the following result, which is a parallel of Lemma [2l 

Lemma 3: Suppose a collection of random variables T is independent of S = (So, Si, S2). 
Then 

l[X;Z Y,T,S^j =\oge J ^ a( 7 )mmse [X| 7 , T] d 7 (56) 

i(X;z\y,T,s) =loge^ 00 /%)mmse[X|7,T]d 7 (57) 

where a( 7 ) and /3( 7 ) are given in © and ©, respectively, and mmse [X| 7 ,T] is defined as 

mmse [X| 7 ,T] =E[X -E[X\^X + U,T}} 2 . 
The proof of Lemma [3] is given in Appendix [H] and a similar interpretation of the result as the 
one for Lemma [2] can be obtained. In order to establish the third constraint in © in Theorem [H 
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we consider the weighted difference between the two mutual informations in (1541) and (1551) : 



ujI X n : Z r 



Y \S n ) -1 X n :Z 



Y ,5 n 



1 X n : Z r ' 



Y ,S n ) -1 X n :Z 



Y n , 5 n ) 1)1 (X n ; Z r 



9 n ,sr 



J2 \l (X n ; Z m , Z n m+1 



m=l 



Y'\S n ) - llX n :Z m+ \Z 



n 



— n 

Y ,S r 



+ {uj-l)l[X n -Z n 



Y\S' 



(58) 



by "Marton-like" expansion as for the layered erasure model. By the chain rule, the RHS of (|58l ) 
can be reduced to: 



n 



Z m ] x n 



Z m -\Z n m+x ,Y n ,S n ) -Z[Z m -X n \Z m - x 

n 

+ (a;-l)^j(z m ;X" 



m=l 



~ n ~ n _ 
7. V <? n 
5 ^m+l> z ' 



< 



n 



^mj X™ 



m=l 



m=l 



rm— 1 



„ 1 ~ n ~ n 

Zm— 1 ry \r c*n 



+ {u-l)Y^l{z m] X 

m=l 

' S }^ J ^1 {z m ]X m Z m 1 ,Z m+1 ,Y , S n ^j — l(z m ;X m Z m 1 ,Z m+1 ,Y , S n 



(59) 
(60) 



m=l 



where (|59l is due to the fact that Z m — X n — Z m+1 is Markovian and u < 1, and (|60l) is because 
that Z m — X m — (X m_1 , X™ +1 ) and Z m —X m —(X m -\ X n m +1 ) are both Markovian. Finally, for 
each m, we apply Lemma [3] with 

/ ™ i ~w ~ m— 1 ~n „ -, 

J- m — I ^ j ^m+li 1 > r m+1' ' J m+1 

which is independent of («So m , Si m , S , 2 TO )» to obtain 



Since 



F ,S n ) —X[Z ;X n 



) n poo 
< J^loge / (w/3( 7 ) - Qt(7)) mmse [X m |T m ] d 7 . 

m=l 70 



mmse[X m |7,T m ] < — — V7 
1 + 7 
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R2 A 




Elog(l + 5i) + l Ri 



Fig. 5. An illustration of the inner and outer bounds for capacity region. The outer bound is drawn in solid line; the inner 
bound is drawn in dashed line. Note that we drop the first constraint in the ® so that the Ri < Elog (1 + Si) + 1. 



we have 



Y\S n ) -1(2 r-x n 



Y ,S n )<n (u,/3( 7 ) - aft))" 



loge 
1 + 7 



dj. (61) 



Comparing (l54l) . (1551) and (I6TI) and noting that 5 n — > as n — > oo, we have established the third 
constraint in ([3]). 



B. The Inner Bound: A Constant Gap Result 

We propose a coding scheme which achieves a rate region within a constant gap to the outer 
bound developed in Section IVI-Al The gap applies to all SNR and fading statistics. Thus the 
inner and outer bounds are asymptotically tight at high SNRs, where the capacity becomes 
large. The coding scheme is inspired by the coding scheme used for fading broadcast channels 
developed by Tse and Yates [17]. 
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To make the analysis easier, we drop the first constraint in © and the new region is still 
an outer bound, denoted by 1Z . Note that the third bound at u> = corresponds to Ri < 
Elog (1 + Si) + 1, which is looser than the first constraint in ©, but within 1 bit. 

Similar to the capacity region for the layered erasure model, besides the two axes, the outer 
bound 7l' is enclosed by two curves, which correspond the remaining two constraints: One is 
line R 2 = Elog (1 + 5*2); the other curve C is the boundary of the region r\ue[o,i]H(u), where 

5*0 



H(u) = < (R^R 



R 1 +luR 2 < 1 +Elog(l + Si) +c;Elog 1 + 



1 + Si 



+ / (u/3(r/)-a(rf))&y 



and 

B(u) = {7 e [0,00)^(7) >a(j)}. 

We claim that for every u G [0, 1], the straight line boundary of H(u) touches the curve C at 
the point 

= fl + Elog(l + S 1 )- / e*( 7 )d 7 , Elog(l + -^-) + / /%)d 7 Y (62) 
V Jb{w) \ + Jb{u) J 

To see this, first, note that the point is on the boundary of H(u) because it achieves the equality 
of the constraint H(cu). Moreover, for every uj' 6 [0, 1], we have 

R 1 {uj) + uj'R 2 {uj) = l+E\og{l + S 1 )+u'Elog(l + — \-) + [ (u'^-y) - a( 7 ))d 7 

V 1 + 01/ 

< l + Elog(l + S 1 )+^Elogfl + -^-) + / (^/3( 7 )-a( 7 ))d 7 

V 1 + ^1/ Jb(u>') 

where the last step is due to the fact that u//3( 7 ) — a ( 7 ) < for every 7 e B(uj)\B{uj'). Thus 
(fli(w),^j(u;)) G fl'(a; / ), which proves the claim. 

Denote points (i?i(l), i? 2 (l)) and (i2i(0), i2 2 (0)) by L fe and L Q , respectively. Generally, the 
curve C can be divided into three parts: The part on the left side of L b is a ray with slope -1; 
the part between L b and L has tangent line with slope steeper than -1; The remaining part is 
a vertically downward ray starting from point L . Another observation is that all the extreme 



points are contained in the closure set M. = { (i?i(u;), R 2 (uj)) \u> G [0,1]}. However, not all 
points between L b and L are contained in the set M. For example, when S , Si, S 2 are all 
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discrete random variables, the outer bound 1Z becomes a polyhedron, like the case of layered 
erasure model. 

The line R 2 = Elog (1 + S 2 ) can intersect with curve C in various locations and we refer to 
the intersection as the top maximum sum-rate point as in the case of the layered erasure model. 
In the following, we first show that every point in M. and below line R 2 = Elog(l + S 2 ) is 
achievable within a constant gap, and then deal with the top maximum sum-rate point. 

The points in M. can either be parametrized with uj as (1621) or be asymptotically approached 
by those can be parametrized with to. Therefore, it suffices to show the achievability result for 
those can be parametrized. Note that P (5*2 > 7) > P (So > 7) whenever 7 G B(uj). Thus, the 
coordinate (1621) can be rewritten as 

S 1 (w) = l+Elog(l + 5 1 )+ / P f 7T~T > l \ ~r~~^l — I P(5 >7)T^d7 

Jb(u>) \Si + l ) 1 + 7 Jb{w) 1 + 7 

(63a) 

M»)=f P f T^T > 7) ^ld 7 + / P(5 2 > 7 )^d 7 . (63b) 



Jb-(uj) \Si + l J 1+7 Jb(u) 1+7 
In following, we show that the rate pair (Ri(u) — A*, R 2 (uj) — A*) can be achieved for some 
universal constant A*. 

Let the transmitter 1 generate a random codebook with i.i.d. unit CSCG distribution. Let the 
signaling of user 2 follow the distribution of X, which is constructed as follows 

x = vf E ^ + vf E ^ 2_i ( 64) 
v i=i v 1=1 

where {X/jjf 3 and {Xqj}^ are independent signals taking ±1 equally likely. Let Xjy denote 
a/3/2 Y.ieM (Xn + 3 x Qi) ■ Fix p > to be a constant, and let p n = p2 2(n_1) , n = 1, 2, .... For 

G [0, 1], let AT(cj) = {n > 1 : p n G £(cj)} and A/" c (c;) = {1,2,... }\AT(uj). The transmitted 
signal of user 2 is the sum of two codewords, X[m] = Xj^^[m] + X^c^[m], m = 1, . . . ,n, 
which carry the common message and the private message respectively. The codebooks for the 
common message and the private message are randomly generated using the distributions of 
X/jc( u ) and respectively. 

Receiver 1 first decodes the common message of user 2 and then decodes its own message. 
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Therefore user 2 can achieve rate 

R 2 {uo) = min (z [y/s[eP Ql W + \fS~ e j@0 X + U; X M < 



Si, Sq, 0i, Go 



+1 (^S~ 2 e^X + V; X MH \s 2 , 6 2 ) , J (v^e^X + 7; x|s 2 , 6 2 )) 



min (l (^/Slw + + U; X mw) 



Si, Sq 



+l(^S~ 2 X + V;X N{u)) 



S-: 



2 > 



5, 



(65) 



where the phase random variables are removed in (1651) . The reason is following: Q 1 is absorbed 
by since it is CSCG; 6 and 2 can be compensated at receivers since receivers know 
each realization of them. Similarly, we can remove the phase random variables in the remaining 
development. After removing the common message, receiver 1 can decode its own message at 
rate 



Ri(u) = 1 (ysle^W + v^e^'X + U; w\x mu) , S h S , &i, 8 ) 



J (ys[w + v^%) + u-w 



Si, S{ 



Si 



- 1 (yFiW + U; W | ^/S~iW + v^%) + U, So, Si) (66) 
Elog (1 + 5i) — J (^/SoX N{u3) + U; X muj) \^S[w + y/S~ X„ {u) + U, S , S^ 
Elog (1 + S 1 )-1 (y/3^X mu) + U; X M{uj) \s ) 

Si, Sq 



+ 1 (\f~S~iW + y/SoXtf^ + U ; X^u,) 



(67) 



where d66]) and $67} are due to the fact that W— {^/S~iW + C/)— W + x/SoX^ + U) and 
X^ — ( v^o Xj\f + U)— (VS[W + ^S~oX M + U) are Markovian. 

Now, we compare (Ri(u), R 2 (u)) with (Ri(u), R 2 (u))- By (I63al) and (1671) . 

Si(w) 

= 1 + 



+ 
< 1 + 



X (</Sb *atm + f/; Xat (w) 5 ) - / P (S > 7) 
. v J Jb{lo) 1 + 7 . 

Jb(u>) \Si + l J 1 + 7 ^ 



*Si, *So 
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+ 



J (v^b^ATM + U- X MH S )- [ P (5 > 7) 

. V ) Jb( u ) 1 + 7 . 



(68) 



where we replace X^o/) with X in the first mutual information due to the fact that X^ u ) and 
X\t c (uj) are mutually independent. 
We have following crucial lemma: 

Lemma 4: Let Z = \/T X + U, where Y is an arbitrary non-negative random variable. Let B 
be a measurable subset of [0, oo). We define M = {n > l\p n E B}. The following inequalities 
hold 



and 



/ P(r>7)^d7-A(p)<X(Z;X^|r) 
Jb 1+7 

l(^/fX N + U-X N T) < / P(r>7)-^d 7 + A(p)-C 
v 1 Jb 1+7 



(69) 



(70) 



where A(p) is independent of T, and C = log (ne/6) + 1 ~ 1.546 bit. Furthermore, A(p) can 
be minimized by choosing p = 5.65, yielding A < 6.386 bit. 

Lemma @] is proved in Appendix [TTTJ We note that inequality (l69l) has basically been shown 
in [17]. 

Applying Lemma @] to (l68~l) . we have 

Ri(u) - R^u) < 1 + A(p) + A(p) - C 
< 2A(p) . 

By ([651), 

= max (ifcH - X (v 7 ^^ + v^X + U; X mu) 1 5 ) - J (v 7 ^ + V\ X M(w) S, 

R 2 {uo)-l{yS~ 2 X + V;X\s, 
By (I63bl) . the first term in (1711) can be written as 

/ P (Vt7^ ^^-l(y%W + y%X + U;X^ 



(71) 



01, 5c 



/ P (S 2 > 7) - X ( v^X + V; X mu) 

Jb(lu) 1 + 7 v 



<A(p) + A(p) 



5, 



(72) 
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5*2 ) <log(y)+l. (73) 



by Lemma |H Meanwhile, since X is uniform, by [26, Eq. (10)], 

Elog(l + S 2 ) -l(y%X + V;x|s 2 ) < log(y) +1. 
By assumption, R 2 (co) is below Elog(l + S2), so that 

R 2 {uj) - l(^fS~ 2 X + V;X 
Putting CD}, d) and (ED together, we obtain 

R 2 {u) - R 2 {u) < max (2A(p), log ) + l) . 
By minimizing A(p) over p > 0, we obtain 

fli(a;) - < 12.772, i = 1, 2. 

Next we deal with the top maximum- sum-rate point. There are three cases: 

Case 1: The top maximum- sum-rate point is below Lq. In this case, Elog(l + S'2) < Elog (l + 
5*0/(1 + 5i)), i.e., the outer bound becomes a rectangle. Let both users use i.i.d. Gaussian 
signaling. Then the two users can achieve rates Elog (1 + Si) and Elog (1 + 52), respectively. 
Indeed, the condition Elog(l + 5 2 ) < E log (1 + 5 /(l + Si)) guarantees that receiver 1 can 
remove the signal of user 2 completely by treating its own signal as noise. Therefore, the gap 
between achievable scheme and outer bound is at most 1 bit for user 1 in this case. In fact, for 
this case, < Ri < Elog (1 + 5j) (i = 1, 2), is exactly the capacity region. 

Case 2: The top maximum- sum-rate point is between L and L b . If the top maximum- sum-rate 
point is in set AL then preceding analysis already covers this case. But it is possible that the top 
maximum- sum-rate point is between L and Lb and it is not in set M.. In other words, the part 
of curve C can be a line segment with slope steeper than —1 and the top maximum- sum-rate 
point is right on this segment. This situation can happen, for example, when 5 , Si and 5 2 are 
all discrete random variables. 

Denote the two ends of the line segment by L u and L d , where L d is below the top maximum- 
sum-rate point. Suppose slope of the line segment is —l/u e , then L u 's coordinate is (Ri(u> e ), R 2 (tu e )) 
and the coordinate of Ld is 



1 + Elog (1 + 50- / a( T )d 7 , Elog(l + -^-) + / PMd-y) 

JB°(u) e ) V l-T^l/ jB°(u> e ) J 



(74) 
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where B°(uj) = {7 G [0, oo)\uj(5{^) > a^)}. For notational convenience, we define B'(uj) = 
B(u))\B°(lu). Since both points are extreme points, they belong to M. By preceding analysis, 
point L d can be achieved up to a gap of 12.772 for each user. Therefore, it is sufficient to show 
the achievability result for the top maximum- sum-rate point because points between L d and the 
top maximum- sum-rate point can be shown by time- sharing argument. The point on the segment 
can be parametrized as 

= fl+Etog(l + 50- / a( 7 )d 7 , Elog(l + -^-)+ / ftW&y) 
V Ja(s) \ + j 

where 

A(5)=B°(uj e )U(B'(uj e )n[0,5)y 

It is easy to see that when 5 varies from to 00, (R 1 (S),R 2 {S)) moves continuously from L d 
to L u . Since the maximum- sum-rate point is between L u and L d , it must be (R(S*), R 2 (S*)) 
for some 5*. Now, let J\f* = {n > l\p n e A(S*)}, where p n = p2 2( - n ~ 1 \ Let user 1 generate 
its codebook according to i.i.d. unit CSCG distribution. Let user 2 generate the codebook for 
private message using the distribution of Xj^* and generate the codebook for common message 
using the distribution of Xj^*c Following the exactly same analysis as we did for the points in 
M, we see that the top maximum- sum-rate point can be achieved up to a gap of 12.772 for 
each user. 

Case 3: The top maximum- sum-rate point is above L b . In this case, the achievability result 
for L b holds by preceding analysis. That is following rate pair is achievable and it is at most 
12.772 away from the critical point for each user: 

i?i(l) = X {^S[W + v^X + U; w\x mi)} S 1} So) 

R 2 (l)=l(^ 1 W + ^ X + U-,X mi) \s 1 ,S S ) +1 (^S~ 2 X + V;X mi) \s 2 ) . 

Since the outer bound between top maximum- sum-rate point and L b is a segment with slope -1, 
it suffices to investigate the achievability of points along the ray with slope -1 and starting from 
(i?i(l), R 2 {1)) till the intersection with the other constraint R 2 = Elog (1 + S 2 ). 

As in the study of layered erasure model, we split user 1 into two virtual users [27]. Let W\ and 
W 2 be two independent CSCG random variables with W x ~ CAf(0, 5) and W 2 ~ CAf(0, 1-5), 
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where 5 G [0, 1]. Let user 1 generate the codebooks for its two virtual users using the distribution 
of W\ and W 2 , respectively. Receiver 1 first decode U, then decode X^c and finally decode V 
in order. Therefore, following three rates are achievable at receiver 1 

So, S] 



R 1>1 (5)=l(yK(W 1 + W 2 ) + Vs~oX+U-,U 



R 2c (S) = 1 (y/Sl{W x + W 2 ) + ^S~ X + U; X mi) 



w 1 ,s ,s 1 



RiM =l( y v^S\(W l + W 2 ) + ^X + U;V 
Receiver 2 can achieve the private rate 

R 2p (S) = l(^S~ 2 X + V-X M{1) 



^Af c (l), Wi, S , Si ) . 



By the chain rule, we have R ltl (6) + Ri ;2 (5) + #2c0) + R 2p (5) = #i(l) + R 2 (l). Therefore, 
we show that (Rn(5) + Ri i2 (5) , R 2c (S) + R 2p (8)) is on the ray with slope -1 and starting 
from (i?i(l), R 2 (l)). That is the gap between the segment (Ri ; i(8) + Ri j2 (5), R 2c (5) + R 2p (5)), 
5 e [0, 1], and the boundary of H(l) is at most 12.772. 

Furthermore, we need to show that R 2c (8) + R 2p (5) can go sufficiently close to line R 2 = 
Elog(l + S 2 ). Note that R 2c ($) + R 2p (S) increases as 5 increases, since larger 5 implies that 
larger part of user l's signal is removed before decoding the common message at receiver 1. 
Letting 5=1, 

i? 2c (l) + i22p(l) = X ( VS~ X + V; X mi) | So) + 1 (^S~ 2 X + V; X m) 
By Lemma |H 

loge 

r\pod.1) — — 07+/ r(D 2 ^7J 

c (i) 



R2c(X) + R2 P (l) + 2A(p) > f P (So > 7) T^d 7 + / P (S 2 > 7 ) f^-d- 

J BHD 1 + 7 Jb(i) 



1 + 7 



> 



P (S 2 > 7) 



1 + 7 



Elog(l + S 2 ) 



where the last inequality is due to the fact P (So > 7) > P (5*2 > 7) whenever 7 6 B c (l). This 
shows that we can choose 8 such that the top maximum- sum-rate point can be achieved up to 
12.772 for each user if the top maximum- sum-rate point is above L^. This completes the proof 
of Theorem [IJ 
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C. Capacity Results for Some Special Cases 

In the remainder of this section, we establish the capacity region for a few special cases which 
is not implied by Theorem [TJ 

Theorem 3: In case of one-sided stochastically strong interference, i.e., P (5*0 > 7) > P (S 2 > 7) 
for all 7 > 0, the capacity region is 



< Ri < Elog(l + 
(Ri,R 2 ): 0< i? 2 <Elog(l + S 2 ) 



(75) 



R 1 + R 2 <E\og(l + S 1 + S 2 ) ^ 
Theorem |3] directly follows the capacity results for general strong interference channel estab- 
lished in [24]. 

Theorem 4: In case of one-sided stochastically weak interference, i.e., P (So > 7) < P (S 2 > 7) 
for all 7 > 0, the sum-capacity of channel © is given by 

C sum = E log (l + — p—] + E log (1 + S 2 ) (76) 



V 1 + S 0/ 

The proof of Theorem @] is relegated to Appendix [IV] By ([76]), to achieve the sum-capacity, 
we need to let user 2 transmit at its full rate. Intuitively, the rate gain of the user 2 is always 
larger than the rate loss of user 1 . In the proof of converse, we show that it is optimal for both 
users to use Gaussian signaling. This does not lead to the extra 1 bit compensation on user l's 
rate as we did in proof of Theorem \T\ Therefore, we can establish a tighter result in this case. 
It remains to see whether the extra 1 bit in the third constraint of ([3]) can be removed in the 
general cases. 

VII. Conclusion 

This work derives the first constant-gap result for the capacity region of the ergodic one-sided 
fading Gaussian interference channel with channel state information at the receiver but not at 
the transmitters. To achieve this, the new outer bound is obtained via investigating the trade-off 
between rate gain and rate loss of the two users. The achievability strategy is constructed by 
artificially layering one transmit signal. Both of the outer and inner bounds are motivated by the 
simpler and exact results for the corresponding layered erasure model. 



November 5, 2009 



DRAFT 



41 



Appendix I 
Proof of Lemma [2] 

Because signals are aligned at their respective least significant bit in a sum, one can write 
H (x?° |xf° © Wi l ,T, iv) 

j2J2p(n 1 = m) p (n = n ) n (xr |xS n °- ni)+ , T) 



rii=0 no=0 
9 9 9 



EEE^^^"^ = l((n -n 1 ) + <Kno)^ \ T ) ( 77 ) 

rii=0 rto=0 i=l 

9 

P (^o - ^ < / < N ) H {XtlX 1 - 1 , T) 



i=i 

9 



{=i 

where (1771) is due to the chain rule. Hence the proof of (|25T) . Similarly, 

n (xf 2 © w^ 1 , t, at) 



X(n -ni)+ 

1 ! 



^ P (JVi = m) P (No = n , N 2 = n 2 ) H (x^ o _ ni)+ 

ni, no, 712 

9 

P(Ni = m) P (JV = n , iV 2 = n 2 ) ^ l ((no -«i)+<K»«)W (Xz|*i~\ T) 

711,no,712 i=l 
9 

53 P (N - Ni <l < N 2 ) H (XtlXt 1 , T) 



i=i 

Furthermore, 



P (JV - iVi < Z < N 2 ) 

= P (JV 2 > - P (A^o - #i > Z, iV 2 > 

9 

= ^ P (JVi = m) (P (iV 2 > - P (N - m > I, N 2 >l)) 



ni=0 
9 



^ P (iVi = m) (P (iV 2 > Z) - min(P (JV > m + Z) , P (iV 2 > /))) (78) 



711=0 



E [P (JV 2 > Z) - P (A^o - Nt > l\Ni)\ 

A 
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where (1781) is due to the alignment between N and N 2 . Hence the proof of (1261) . 



Appendix II 
Proof of Lemma [3] 

Since X — Z — Y is Markovian, we have 



1\Z;X 



1 Z:X 



S,T) -1(Y;X 

S 



S,T 



rso /-S /(Si+l) 

logeE / mmse[X|7,T]d7-logeE / mmse [X\j, T] d-f (79) 

Jo Jo 



logeE / mmse [X|7,T] d7 

JSo/(Si+l) 

POO 

loge / El ( 5 o/(5l+ i )<7 < 5o) mmse[X|7,T]d7 
Jo 

So 



logc /„ P V s, , 



<1 <S Q ) mmse[X|7,T] d7 



POO 

= loge / a(7)mmse [X|7, T] d7 
Jo 

where (1791 is obtained by using the integral representation of mutual information via MMSE [28]. 
Hence the proof of (|56l) . 

Note that for any realization of S, either X — Z — Y or X — Y — Z is Markovian. Furthermore, 
when it is the latter one, the mutual information is zero. Therefore, for every realization s of 
the states, 



1 Z:X 



Y,S = s,T) = (J (Z;X\S = s,T) - I[Y;X 



S = s,T 



Thus, 



1 Z:X 



Y,S,T 



P Sl (dsi) J P A (dA) (I (Z;X\S = s,T) -1 (Y,X S = s,T 

' rS 2 rSa/iSt+l) 

: logeE / mmse [X|7,T] d7 - / mmse [X\^y, T] d'j 

Jo Jo 

poo 

= logeE / l (5o /(5 1 +i)<7<5 2 )mmse[X|7,T]d7 
Jo 

POO 

loge / P(5 /(Si + l) <7< 5 2 )mmse[X| 7 ,T]d 7 . 
Jo 
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Furthermore, 

P (S /(S l + 1) < 7 <S 2 ) 
= P (5 2 > 7 ) - P (So/ (Si + 1) > 7, S 2 > 7) 

= P (5 2 > 7) - Emin (P (S /(S X + 1) > 7 |5 a ) , P (S 2 > 7 )) (80) 
= E [P (5 2 > 7) - P (Sb/OSi + 1) > 7l^i)] + . 

= P{i) 

where (f8~0l) is due to the alignment between So and S 2 . Hence the proof of (1571) . 

Appendix III 
Proof of Lemma [4] 

Instead of computing the mutual information directly, the authors of [17] show that the system 
with input Xjy and output Z has an achievable rate at least the amount of RHS of (|69| ). Hence 
(l69l) holds by coding theorem of point-to-point system [21]. 

To prove (f70l> . we apply ([69]) to the sets 23 c and A/" c , where A/" c = {1, 2, . . . }\Af, 

[ P(r>7)T^ci7-A(p)<j(Z;X^|r,e') . 

It can be rewritten as 

Elog(l + r)- / P(r> 7 ) T ^d7-A(p) 

1 + 7 

< I{Z;X\T) -l(Z;X Af \X Afa ,T) 

= j (z ; x|r) - j (Vf x M + x^|r) . (8i) 

Note that X has a uniform distribution on unit square. By [26, Eq. (10)], we have 

Elog(l + r)-Z(Z;X|r) <log(y) +1. (82) 
Comparing (|8TT) and (|82l) , we have established (fTOl ). 
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Appendix IV 
Proof of Theorem @] 

For the achievability, we let both users generate the codebook with unit CSCG distribution. 
Assign rate Elog (1 + S 2 ) to user 2. Receiver 1 decodes its own message by treating the signal 
from user 2 as noise. Therefore, following rate pair is achievable 

Si 



R 2 = Elog(l + S 2 ) 

which, in turn, achieves the sum-capacity. 

For the converse, without changing the capacity region, we assume without loss of generality 
that Sq and S 2 are driven by the same uniform random variable A and O = 02 = 0. Note that 
with this modification, we have S < S 2 . By Fano's inequality 

nR 1 +nR 2 - nS n < 1 (Y n ; W n \S n ,@ n ) + 1 (Z n ; X n \S n , & n ) 

<l(Y n ;W n ,X n \S n , 0") - 1 ({Y n ; X n \W n , S n , n ) 

+ l(Z n ;X n \S n ,@ n ) 

< log (1 + 5Q + 50 - l({ v / S ~X + U} n ;X n \s , { 



(83) 



+ i({^s~ 2 x + uy-x n \s 

Using "Marton-like" expansion as we did for the outer bound of Gaussian model (see the 
development of (l60l)). rewrite the last two terms as 



J ({ y^X + U} n ; X n 5 n ) - 1 ({ y^X + U} n ; X n 



n 

J2{z(V&iXi + Ui;X n 

i=l 



Ti, S 2 i ) — X 



+ Ui] X n Ti, S i^ I 



where T z = {{VS^X + UY~\ {y%X + U}? +1 , S'' 1 , fif? +1 ). 

Note that (X^ 1 , X? +1 ) — Xi — (y/S^Xi + Ui, i/Soi Xi + U) is Markovian. Therefore the 
difference between the two mutual informations can be further rewritten as 



T,,S 



1=1 

= J2 E mmse[X i |7,T 4 ]d 7 

i=i J s ° 



X ( v Soi Xi + Ui: X~ 



Ti, Soi 



)} 
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where we use the integral representation of mutual information via MMSE [28]. Moreover, 



mmse [Xi\j,Ti\ < —J— V 7 > 0. 

1+7 



Hence, we can establish that 



X ({ t/S~ 2 X + U} n ; X n 5 n ) - X ({ t/S~ X + U} n ; X n 

rs 2 1 

< nE / d7 

~ Jso 1+7 

= nE log (1 + S 2 ) - nE log (1 + S Q ) . 

Comparing ([83) and ([841) yields 

nRx + nR 2 - n5 n < nElog (1 + S Q + Si) + nElog (1 + S 2 ) - nE log (1 + S ) . 

Letting n — » oo, we have 

R! + R 2 < Elog (1 + S + SJ + Elog (1 + 5 2 ) - Elog (1 + S ) 

Si 



(84) 



Elog 1 



l + S 



Elog(l + 5 2 ^ 



which completes the proof. 
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